DOCUMENT RESUME 

ED 351 393 TM 019 232 



TITLE 



INSTITUTION 
PUB DATE 
NOTE 
PUB TYPE 

EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



Connecticut Education Evaluation and Remedial 
Assistance* Grade 4 Mastery Test Results. Summary and 
Interpretations: 1991--92. 

Connecticut State Dept. of Education, Hartford. 
92 

151p. 

Statistical Data (llO) — Reports - Descriptive (141) 
MF01/PC07 Plus Postage. 

Achievement Gains; Criterion Referenced Tests; 
Educational Objectives; Elementary Education; 
^Elementary School Students; *Grade 4; Intermediate 
Grades; Listening; *Mastery Tests; Mathematics 
Achievement; Reading Achievement; State Programs; 
*State Surveys; Testing Programs; *Test Results; 
Writing Achievement 

^Connecticut Mastery Testing Program 



ABSTRACT 

An overview and summary are presented of the 
implementation of the Connecticut Mastery Test for grade 4, The 
testing program assesses essential skills in mathematics and language 
arts including listening, reading, and writing for students in grades 
4, 6, and 8. The criterion-referenced mastery test assesses how well 
each student is performing on skills identified by content experts 
and practicing educators as important for students entering the 
fourth grade. In 1991, fourth graders mastered an average 21.2 of the 
25 mathematics objectives tested, representing no change from the 
preceding year. A total of 88.4 percent of the students scored at or 
above the remedial standard, slightly up from the preceding year. In 
language arts, there was no change from the preceding year, as fourth 
graders mastered an average of 6.3 of the 9 objectives tested. In 
writing, fourth graders averaged 4,9 on a scale of 2 to 8, slightly 
down from 1990, although the number scoring above the remedial 
standard increased somewhat. In reading, fourth graders averaged 49 
units on the Degrees of Reading Power, up slightly from 1990. About 
53 percent scored at or above the reading goal, an increase from 
1990. Comparative information for 1985 through 1991 is given. Twelve 
charts present test results, and 12 appendixes provide supplemental 
information about testing and scoring. (SLD) 
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LEGISUTIVE BACKGROUND 

In June 1984, the General Assembly of the State of Connecticut amended Section 
10-14 m-r of the Connecticut General Statutes, an act concerning Education 
Evaluation and Remedial Assistance (EERA)* This law provides that: 

0 By May 1, 1985, each local or regional board of education shall have 
developed and submitted for State Board of Education approval, a new 
plan of educational evaluation and remedial assistance* Each plan 
had to address the following: 

0 the use of student assessment results for instructional 
improvement; 

0 the identification of individual students in need of remedial 
assistance in language arts/reading and mathematics; 

0 the provision of remedial assistance to students with identified 
needs; and 

0 the evaluation of the effectiveness of the instructional 
programs in language arts/reading and mathematics • 

0 The State Board of Education shall administer an annual statewide 

mastery test in language arts/reading and mathematics to all fourth-, 
sixth- and eighth-grade students, with the following exceptions: 

0 Special Education students who are excluded by a Planning and 
Placement Team (PPT) decision; 

0 students who have been enrolled in an "English as a Second 
Language" program for two years or less; or 

0 students enrolled in a Bilingual Program (as defined in Section 
10-17e of the Connecticut General Statutes) for two years or 
less* 

0 Each student who scores below the statewide remedial standard on one 
or more parts of the eighth-grade mastery examination shall be 
retested* These students shall be retested annually, using the 
eighth-grade mastery test, only in the deficient area(s) until such 
students score at or above the statewide remedial standard(s). 

0 Biennially, each local or regional board of education shall submit to 
the State Board of Education a report which includes indicators of 
student achievement and instructional improvement* 

0 On a regularly scheduled basis, the State Board of Education shall 
complete field assessments of the implementation of local EERA plans* 
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0 On an annual basis, test results and low Income data shall be used to 
determine the distribution of available state funds to support 
remedial assistance programs. 

The purpose of this report is to provide an overview and summary of the 
implementation of the fourth-grade Connecticut Mastery Test- The mastery test 
assesses how well each student is performing on those skills identified by 
content experts and practicing educators as important for students entering 
fourth grade to have mastered. 



FOREWORD 



The Connecticut Mastery Test is a critical element in Connecticut's agenda to 
attain educational equity and excellence. The testing program assesses 
essential skills in mathematics and language arts, including listening^ 
reading and writing, for grades four, six and eight students. Student 
achievement is measured and reported in relation to specific learning 
objectives that students reasonably can be expected to have mastered by the 
end of grades three, five and seven. 

The Connecticut Mastery Test provides valuable educational information which 
can be used to improve instruction and elevate the achievement of 
Connecticut's students. The test results are reported in a manner that 
identifies how well each student is succeeding in relation to clearly defined 
and meaningful standards. It is our hope that educators throughout the state 
use the results as a tool to gain better understanding of the learning 
occurring in our classrooms and the ways to increase learning in the future. 

Connecticut is committed to an annual cycle of assessment in order to promote: 

0 the monitoring of individual student achievement; 

0 the evaluation of instructional program effectiveness; 

0 educational goal setting; and 

o remedial assistance program improvement. 

An examination of the results since 1985 reveals many signs of steady, 
incremental improvement. The general improvement since the start of the 
program is quite impressive in some areas. Yet the many Connecticut educators 
who helped to build the program had the foresight to include some very 
demanding content and standards. Student performance in relation to these 
expectations reveals that much remains to be done. 

As you examine these results, it is our hope that the many stories they tell 
will prove useful and informative. Department staff are available to 
facilitate the interpretations and application of these test scores. 




Peter Behuniak 
Acting Chief 

Bureau of Evaluation and Student Assessment 
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OVERVIEW OF THE MASTERY TESTING PROGRAM 



In the spring of 1984, the Connecticut General Assembly amended the Education 
Evaluation and Remedial Assistance (EERA) legislation to authorize the 
creation of mastery tests in the basic skill areas of mathematics and language 
arts, including listening, reading and writing skills. The tests were to be 
established for grades four, six and eight. 

The goals of the mastery testing program are: 

0 earlier identification of students needing remedial education; 

0 testing a more comprehensive range of academic skills; 

0 setting high expectations and standards for student achievement; 

0 more useful test achievement information about students, schools and 
districts; 

0 improved assessment of suitable equal educational opportunities; and 

0 continual monitoring of students in grades four, six and eight. 

The type of test that best addresses these goals is a cri terion-referenced 
test. Criterion-referenced tests are designed to assess the specific skill 
levels of students. Such tests usually cover relatively small units of 
content. Their scores have meaning in terms of what each student knows or can 
do. Test results are used to identify the areas of strengths and weaknesses 
of each student. 

MASTERY TEST CONTENT 

The CMT is designed to assess essential language arts/reading, writing and 
mathematics skills that can reasonably be expected to be mastered by most 
students by the end of the third, fifth and seventh grades. The specific 
skills to be tested within these content areas were identified by committees 
of educators from throughout the state. In addition, surveys were sent to 
many teachers, administrators and parents to detern.ine the appropriateness of 
these skills for the Mastery Test. A complete description of the procedures 
used in the development of the fourth-grade CMT can be found in Appendix A 
(p. 31). 

Mathemati cs 

The Mathematics Advisory Committee recommended a grade four mathematics test 
that assessed twenty-five (25) specific objectives in four domains: 
(1) Conceptual Understanding; (2) Computational Skills; (3) Problem 
Solving/Applications; and (4) Measurement/Geometry. There are four test items 
per objective for a total of 100 items on the mathematics test. A detailed 
list of domains and objectives is given in Appendix B (p. 35). 

Language Arts 

The Language Arts Advisory Committee recommended a 103-item grade four 
language arts test that covers two domains: Reading/Listening and 
Writing/Locating Information. Nine (9) objectives were recommended by the 
Language Arts Advisory Committee. 

-1- 
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The general content of Reading/Listening consisted of narrative, expository 
and persuasive passages on a variety of topics measuring a student's ability 
in: (1) Literal Comprehension; (2) Inferential Comprehension; and (3) 
Evaluative Comprehension. Audiotapes were used to assess students' listening 
comprehension ability in: (1) Literal Comprehension and (2) Inferential and 
Evaluative Comprehension. The Degrees of Reading Power (DRP) test was also 
used to assess reading. The DRP test included eight (8) passages and 
fifty-six (56) test items. It was designed to measure a student's ability to 
understand nonfiction English prose at different levels of reading difficulty. 

The general content area of Writing/Locating Information consisted of three 
components. First, there was a writing sample for direct, holistic assessment 
of student writing. Each student was asked to write a composition on a 
designated topic. Writing was then judged on a student's demonstrated ability 
to convey information in a coherent and organized fashion. Second, the 
mechanics of good writing, which was defined as (1) Capitalization and 
Punctuation, (2) Spelling, .^'omonyms and Abbreviations and (3) Agreement, was 
assessed in a multiple-choice format. Third, Locating Information (Schedules, 
Maps, Index and Reference Use and Dictionary Meaning), measured students' 
ability to find and use information from the sources listed. A detailed list 
with objectives and number of items per objective is given in Appendix C 
(p. 37). 



FUTURE DEVELOPMENT 

The Connecticut State Department of Education (CSDE), in conjunction with 
content consultants and various CMT advisory committees, has begun the 
development of the second generation of the CMT. The current CMT is under 
review to determine which skills are appropriate for inclusion on the new 
test. In addition, new content areas and other forms of assessment techniques 
(e.g., performance assessment and short-answer questirns) are being 
considered. It is anticipated that the second generation CMT will be 
administered for the first time statewide in the fall of 1993. Items for this 
set of exams were piloted in the fall of 1991 and will be followed by a second 
pilot in the fall of 1992. 



SETTING MASTERY STANDARDS BY OBJECTIVE 

The essence of the Connecticut Mastery Test (CMT) is the establishment of a 
specific mastery standard against which each student's knowledge and 
competency on each objective can be compared. The mastery test Incorporates 
appropriate and challenging expectations for Connecticut public school 
students. The goal of the CMT Program is for each student to achieve mastery 
of all objectives. The objectives being tested were identified as appropriate 
and reasonable for students at each of the grades tested. These tests are 
designed to measure a student's performance on these specific objectives. 

The process of establishing the mastery standards by objective used a 
statistical method that required two decisions to be utilized. The first 
decision defined a student who mastered a particular skill as one who had a 
95X chance of correctly answering each item within the objective. The second 
decision was that the specific standard for each objective would identify 991 
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of the students who mastered the ski IK By applying the two decision rules 
stated above to a binomial distribution table, mastery standards were 
established for the 25 mathematics objectives and the 9 language arts 
objectives- 

The mastery standards are as follows: 

0 In mathematics, for each of the 25 objectives, a student must answer 
correctly at least 3 out of 4 items. 

0 In language arts, for the 9 multiple-choice objectives with varying 
numbers of items, a student must answer correctly the following 



numbers of items: 



WRITING MECHANICS 



# Items Correct 
for Mastery 



(1) Capitalization & Punctuation 


9 


out 


of 


12 


(2) Spelling 


7 


out 


of 


9 


(3) Agreement 


11 


out 


of 


15 


LOCATING INFORMATION 










(4) Schedules, Maps, Table of Contents, 










Title Page and Dictionary 


8 


out 


of 


11 


LISTENING COMPREHENSION 










(5) Literal 


5 


out 


of 


7 


(6) Inferential and Evaluative 


9 


out 


of 


13 


READING COMPREHENSION 










(7) Literal 


9 


out 


of 


12 


(8) Inferential 


10 


out 


of 


14 


(9) Evaluative 


7 


out 


of 


10 



No mastery standards were set for the two holistic language arts measures, 
neither the Degrees of Reading Power (DRP) test nor the Writing Sample, since 
these measures are not composed of objectives on which mastery could be 
assessed. 



SETTING REMEDIAL (GRANT) STANDARDS 

In addition to mastery standards. Section 10-14 m-r of the Connecticut General 
Statutes requires that the Connecticut State Board of Education establish 
statewide standards for remedial assistance in order to meet two 
responsibilities: 

o to identify and monitor the progress of students in need of remedial 
assistance in language arts/reading and mathematics as part of the 
EERA field assessments; and 
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0 to distribute EERA funds based on the number of needy students 

statewide, as well as for use in the Chapter 2 and Priority School 
District Grants. 

Students who score below the remedial standard(s) are eligible for services 
provided for in EERA legislation. Remedial standards were established by the 
State Board of Education acting on the recommendations of committees that 
represented Connecticut citizens and educators. The standard-setting 
committees recommended the following remedial standards: 

1. In mathematics, a student who answers fewer than 69 of the 100 items 
(69%) correctly is required to receive further diagnosis by the local 
school district and. if necessary, to be provided with remedial 

assi stance* 

2. In reading, a student whose Degrees of Reading Power (DRP) unit score 
1s lower than 41 is required to receive further diagnosis and. if 
necessary, to be provided with remedial assistance. 

3. In writing, a student receiving a total holistic score less than 4 is 
required to receive further diagnosis by the local school district 
and. if necessary, to be provided with remedial assistance. 

The mastery and remedial standards were established by the State Board of 
Education on June 23. 1985. For a detailed explanation of the remedial 
standard-^setting process, see Appendix D (p. 39). 



STATEWIDE ACHIEVEMENT GOALS 

In addition to mastery and remedial standards, statewide achievement goals 
have been established in the content areas of mathematics, reading (DRP) and 
writing. These goals represent high expectations and high levels of 
achievement for Connecticut public school students. 

The achievement goals are as follows: 

0 In mathematics, all students must master 22 of 25 objectives tested. 

0 In reading, a student must score a Degree of Reading Power (DRP) unit 
score of 50 with 70% comprehension. 

0 In writing, a student must score a total holistic score of 7 on a 
scale of 2 to 8. 



STUDENT GROWTH OVER TIME 

The Connecticut Mastery Test (CMT) Program is designed to provide 
criterion-referenced information about the level of student mastery of 
objectives In grades four, six and eight. However, the basic scores reported 
for the mastery tests do not provide a system for evaluating achievement 
growth from grade four to grade six to grade eight. This is so because 
mastery decisions are based on student performance (mastery/non-mastery) on 



objectives that are unique to grade level- Mastery of objectives cannot be 
compared directly across grade levels and tests because of the differences 1n 
the number of objectives, curriculum content and levels of difficulty. In 
order to make valid interpretations across grade levels, the mastery test 
performance must first be linked using a procedure called vertical equating. 



Purpose of Vertical Equating 

Vertical equating is a psychometric technique for comparing tests at all 
ability levels. This is accomplished by putting them on a new scale which is 
common to the tests. Vertical equating is based on two assumptions. The 
first is that learning is continuous. The second is that instruction in each 
area is related to increased achievement in that area. These assumptions 
enable test developers to create a score scale that covers a wide range of 
content over several grades. The development of these "growth scales" is a 
common practice and has been used successfully in the development of a variety 
of achievement test batteries. The purpose of vertical equating is to provide 
one scale score system which can be used to compare performance across 
multiple grade levels. This score system enables test users to interpret test 
score information over time without altering the basic nature of the testing 
program. This achievement growth can be monitored over time on the basis of 
student performance on the CMT across grades. 



Development of Vertical Scales 

In order to develop a vertical scale, performance on the grade four, grade six 
and grade eight mastery tests was statistically linked. This was accomplished 
during the 1987 administration of the CMT using representative statewide 
samples of approximately 5,000 sixth-grade students and approximately 7,000 
eighth-grade students. Each group of students at grade six and grade eight 
was administered the appropriate on-grade level test form of the CMT along 
with one below-grade level section of the CMT. Specifically, each group of 
eighth-grade students took the grade eight test as usual and a part of the 
grade six test. Likewise, each sixth-grade group took the grade six test as 
usual along with a section of the grade four test. Each sample of students 
took only one below-level section of the CMT involving approximately one hour 
of additional testing time. Performance on the below-level items was not 
counted toward the CMT scores of individual students. For each of these 
linking samples, item difficulty estimates were obtained for the on-grade and 
below-grade level items by analyzing all items together as one test. Once 
items from the on-grade and below-grade level tests were linked, item 
difficulties from each level of the CMT were adjusted to a common metric to 
produce the vertical scale. 

Vertical scales were established in the content areas of mathematics and the 
reading comprehension section of the language arts test. For each grade and 
content area, every correct score corresponds to a specific value on a common 
score scale (vertical scale). Each of the vertical scales was constructed so 
that each scale score point represents the same theoretical achievement level 
whether derived from a score on the grade four test, a score on the grade six 
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test or a score on the grade eight test. This allows valid interpretations of 
growth across time using tests differing in content, length and item 
difficulty. All items on the mathematics and reading comprehension tests were 
used in the development of the vertical scales- The writing and language arts 
tests were not scaled because of the nature of these assessment processes. 
The Degrees of Reading Power (DRP) test employs DRP unit scores which are 
already on a common scale across grades, obviating the need for any other 
development- (For more information see Congero, 1989, The Development 

of Vertical Scales to Enhance the Evaluation of Assessment Data. Paper 
presented at the annual conference of the National Council of Measurement in 
Education, San Francisco, CA. This paper is available through the Student 
Assessment and Testing Unit of the Bureau of Evaluation and Student 
Assessment.) 

Scaled scores can be used to measure growth over time because CMT scores from 
all three grade levels have been placed on a common scale. These scales 
provide a means of monitoring students' academic progress from grade to 
grade. Before the scales were developed, it was difficult to assess the 
performance of groups of test takers as they moved from grade to grade because 
of differences in test length, curriculum content covered and levels of 
difficulty on the fourth-, sixth-- and eighth-grade tests. 

Since students who took the fourth-grade test in 1988 subsequently took the 
sixth-grade test in 1990, change in test performance can be assessed across 
two years' time. Similarly, change in performance can be assessed for 1991 
sixth graders who took the grade four test in 1989. A summary of the overall 
growth in performance for these two groups of students in the content areas of 
mathematics and reading comprehension can be found in the 1991-92 Grade 6 
Summary and Interpretations Manual. Students who took the fourth-grade test 
in 1986 subsequently took the sixth-grade test in 1988 and the eighth-grade 
test in 1990. Similarly, students who took the fourth-grade test in 1987 
subsequently took the sixth-grade test in 1989 and the eighth-grade test in 
1991. A summary of the overall growth in performance for these groups of 
students in the content areas of mathematics and reading comprehension can be 
found in the 1991-92 Grade 8 Summary and Interpretations Manual. 



NORMATIVE INFORMATION 

The CMT Program is designed to provide detailed information about fourth-, 
sixth- and eighth-grade students' mastery of specific skills and objectives. 
The provision of national norms with CMT results is intended to enhance the 
usefulness and flexibility of mastery test information by offering a bridge to 
conventional norm-referenced testing programs. The decision to provide 
normative information with the CMT does not change the essential purposes of 
our criterion-referenced testing program. The CMT will continue to be used 
for diagnostic and other instructional purposes with results reported at the 
student, classroom, school, district and state levels. 

In particular, national norms provide greater: 

0 Test Economy. By providing national norms with CMT results, school 
districts can eliminate their standardized testing programs at these 
grades, thus saving money and undue testing time while retaining 
normative data. 



-6- 

16 



0 Test Efficiency, Federal compensatory programs require the 

systematic testing of students using instruments that can provide 
normative information. Because norms are provided with the CMT, 
school districts will not have to "double test" compensatory program 
students. This service allows for increased instructional time for 
these students. 

0 Test Interpretabllity. Criterion-referenced test (CRT) programs 

may be criticized because the public has difficulty interpreting CRT 
performance. National norms will assist in the interpretation of CMT 
performance by providing a traditional benchmark with which the 
public is familiar. 



Development of Norms 

In order to provide estimated national norm-referenced data based on CMT 
performance, items on the CMT were statistically linked to items on a 
nationally norm-referenced test (NRT). Content-appropriate items from a 
nationally normed host test were included on the CMT to provide a common 
referent to both tests. Test equating procedures were then used to link CMT 
items with the normed test by placing all the items on a common scale. With 
this linkage in place, estimates of how the performance of Connecticut 
students compares to a national sample could be made. The NRT used to 
accomplish this task was the sixth edition of the Metropolitan Achievement 
Test (MAT-6). normed in 1986. The equating of the CMT to the MAT-6 enabled 
group summary scores on the CMT to be interpreted relative to the MAT-5 
nationally representative normative data. 

The CMT was initially equated to the MAT-6 during the pilot testing phase to 
investigate the relationship of the test content and material between the two 
tests and the differential nature of the items included on the CMT and MAT-6. 
In addition, these preliminary data provided a benchmark by which the 
stability of the link could be monitored over time. The stability issue is 
monitored each year by readministering MAT-6 items during CMT administrations 
using representative statewide samples. The comparison of these data with 
prior information provides the information necessary to identify the 
instructional effects on student performance over time and to update the 
CMT/MAT-6 link as appropriate. This monitoring and updating ensures the 
continued accuracy of the normative estimates. 



RESEARCH OPTIONS PROGRAM 

The Research Options Program is a free service provided by the Connecticut 
State Department of Education (CSDE) to help educators and educational 
policymakers gain access to the extensive information available from the 
Connecticut Mastery Test (CMT). Participation in the Research Options Program 
is completely voluntary. 

The Research Options Program allows educators and educational policymakers 
(i.e., superintendents, principals, researchers, evaluators and school board 
members) to benefit from customized research investigations designed to suit 
their individual needs or questions. Many school districts have taken 
advantage of the Research Options Program in previous years to successfully 
address special local concerns. 
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The Research Options Program provides a number of ways of examining student 
achievement, as measured by the CHI. For example, one method is to compare 
aggregated student test scores obtained from the CMT in two or more categories 
of interest. Categories might include males and females, special program 
students compared to non-special program students, or any other comparison. 
These reports include tables that show the proportion of students mastering 
each objective, average number of objectives mastered and the achievement 
indicators for students on each component of the test under consideration. 
These breakdowns allow district personnel to directly compare the performance 
of specific groups of students. In addition, graphics are provided, as 
appropriate, with each report in order to simplify the task of interpreting 
data. 

The Research Options component of the CMT has grown a great deal since the 
first study was performed on the Connecticut Basic Skills Proficiency Test 
almost a decade ago. This year, test directors and evaluators in 26 districts 
took advantage of this valuable resource to address questions of local 
Interest. In addition, statewide programs such as Bilingual Evaluation. 
Chapter I and School Effectiveness have used the research options to obtain 
useful information for participants in over 100 districts. [For more 
information see Mooney. R.F.. 1989. The Connecticut Mastery Test Research 
Options Program: The Application of State Cri terion^Referenced Test Reports 
for Local Research Needs. Paper presented at the annual conference of the 
National Council of Measurement in Education. San Francisco. CA. See also the 
Research Options Handbook (1988) provided by the Connecticut State Department 
of Education. (These references are available through the Student Assessment 
Unit of the Bureau of Evaluation and Student Assessment.)] 



TEST ADMINISTRATION AND SCORING 

The regular administration of the Connecticut Mastery Test (CMT) for 1991 was 
conducted using Form D during a three-week period commencing on September 23. 
1991. Test sessions were conducted by local school district staff under the 
supervision of local test coordinators who had been trained by staff of the 
Connecticut State Department of Education (CSDE) and The Psychological 
Corporation (TPC). A student who took all subtests participated in 
approximately six and one-half hours of testing. 

The Grade 4 Connecticut Mastery Test had seven testing sessions. 

Mathematics I (60 minutes) 

Mathematics II (60 minutes) 

Writing Sample (45 minutes) 

Degrees of Reading Power (60 minutes) 

Reading Comprehension (60 minutes) 

Listening Comprehension (45 minutes) 

Writing Mechanics/Locating Information (60 minutes) 

At the conclusion of the make-up testing period, answer booklets were returned 
to TPC in San Antonio. Texas for optical scanning and scoring, and then 
organized in preparation for holistic scoring workshops. 
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Scoring of the Language Arts and Mathematics Tests 



The mathematics and language arts multiple-choice tests were machine-scored by 
TPC* Mathematics scores were reported for the total test as well as for 
mastery by each objective* Language arts scores were reported for mastery of 
each objective only* 



Scoring of the Writing Sample 

Every writing sample was scored by Connecticut educators using a technique 
known as the holistic scoring method* Holistic scoring is an impressionistic 
and quick scoring process that rates written products on the basis of their 
overal 1 quality* It relies upon the scorers* trained understanding of the 
general features that determine distinct levels of achievement on a scale 
appropriate to the group of writing pieces being evaluated* All participants 
received on-site training and were required to demonstrate a clear 
understanding of the scoring criteria prior to actually scoring student 
essays* Each paper receives a final score between 2 and 8, where 2 represents 
a poor paper and 8 represents a superior paper* A thorough description of the 
training and scoring process, including sample papers representing different 
holistic scores, is presented in Appendix E (p* 45)* 



Analytic Scoring 

All papers receiving holistic scores at or below the remedial standard of 4 
also received analytic scoring in four categories (traits): focus, 
organization, support/elaboration and conventions* Analytic scoring is a 
thorough, trait-by-trait analysis of those components of a writing sample that 
are considered important to any piece of writing in any context* This scoring 
procedure can provide a comprehensive picture of a student's writing 
performance if enough traits are analyzed* It can identify those traits that 
make a piece of writing effective or ineffective* However, the traits need to 
be explicit and well defined so that the raters understand and agree upon the 
basis for making judgments about the writing sample* The analytic rating 
guide and sample marker papers for the analytic scoring are presented in 
Appendix F (p. 57). 



Scoring of the Degrees of Reading Power (DRP) Test 

The DRP multiple-choice test was machine-scored by TPC* The scores reported 
are in DRP units* These scores identify the difficulty or readability level 
of prose that a student can comprehend* This makes it possible to match the 
difficulty of written materials with student ability* These scores can be 
better interpreted by referring to the readability levels of some general 
reading materials as shown below: 

0 Elementary textbooks (grades 3-5) - 35-58 DRP Units 

0 Fiction Section - children's magazines - 48 DRP Units 

A much more extensive list of reading materials is contained and rated in the 
Readability Report s Seventh Edition, published by The College Board* 
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The conversion between DRP unit scores and raw scores can be made from the 
tabled, values obtainable through the Student Assessment and Testing Unit of 
the Bureau of Evaluation and Student Assessment. 



SCHOOL DISTRICT TEST RESULTS REPORTING 

The CMT school district reports are designed to provide useful and 
comprehensive test achievement information about districts, schools and 
students. Four standard test reports are generated to assist superintendents, 
principals, teachers, parents and students to understand and use 
criterion-referenced test results. Appendix G (p. 61) presents samples of the 
district, school, class and parent/student diagnostic score reports. 



FALL 1991 STATEWIDE TEST RESULTS 

The Grade 4 Connecticut Mastery Test provides a comprehensive evaluation of 
student performance on specific skills that Connecticut educators feel are 
important at the beginning of fourth grade. The mastery test's greatest 
instructional utility lies in its identification of areas of student weakness 
and strength. This report profiles the statewide results. Each school 
district also receives a full complement of reports that identify patterns of 
academic strength and weakness at the district, school, classroom and 
individual student levels. 

Chart 1 (p. 12) gives a statewide summary of the average number of objectives 
mastered (mathematics and language arts), average writing and reading scores, 
the number of students scored, the number of students scoring at or above the 
remedial standard and goal (where applicable) and the percent of students 
scoring at or above the remedial standard and goal (where applicable). 
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The following are highlights of the 1991 Grade 4 CMT results: 
MATHEMATICS 

0 Fourth graders mistered an average of 21.2 of the 25 objectives 
tested, representing no change from last year. 

0 A total of 88.4% of the students scored at or above the remedial 
standard, up slightly from last year's figure of 88.3%. 

0 A total of 62.3% of the students scored at or above the mathematics 
goal, an increase from last year's figure of 61%. 

LANGUAGE ARTS 

0 Fourth graders mastered an average of 6.3 of the 9 objectives tested, 
representing no change from last year. 

WRITING 

0 Fourth graders averaged 4.9 on a scale of 2 to 8, down slightly from 
last year' s 5.1 . 

0 A total of 89.0% of the students scored at or above the remedial 
standard, an increase from last year's figure of 87.8%. 

0 A total of 13.9% of the students scored at or above the writing goal, 
down from last year's figure of 18%. 

READING 

0 Fourth graders averaged 49 units on the Degrees of Reading Power 
(DRP) test; up slightly from last year's average of 48 units. 

0 A total of 76% of the students scored at or above the remedial 
standard, an increase from last year's figure of 72.9%. 

0 A total of 52.8% of the students scored at or above the reading goal, 
an increase from last year's figure of 49%. 
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CHART 1 

1991 CONNECTICUT MASTERY TEST RESULTS 
GRADE 4 STATEWIDE SUMMARY 



AVERAGE 

NUMBER OF NUMBER OF STUDENTS AT OR ABOVE STUDENTS AT OR ABOVE 

OBJECTIVES STUDENTS REMEDIAL STANDARD* STATE GOAL" 

SUBJECT MASTERED SCORED NUMBER PERCENT NUMBER PERCENT 



MATHEMATICS 21.2 35,457 31,332 88.4% 22,073 62.3% 
LANGUAGE ARTS 6.3 35,067 



AVERAGE 
HOLISTIC SCORE 

WRITING SAMPLE 4.9 34,877 31,026 89.0% 4.848 13.9% 



AVERAGE DRP 
UNIT SCORE 

READING 49 35.312 26.843 76.0% 18.632 52.8% 



* MATHEMATICS REMEDIAL STANDARD = 69 ITEMS CORRECT 
WRITING REMEDIAL STANDARD = 4 

READING REMEDIAL STANDARD = 41 DRP UNITS 



** MATHEMATICS GOAL = 22 OBJECTIVES MASTERED 
WRITING GOAL = 7 

READING GOAL = 50 DRP UNITS 
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Mathematics 

In mathematics, fourth graders mastered an average of 21.2 objectives, or 
84.8%, of the 25 objectives tested. While the state's goal is that all 
students master every objective, an interim standard (22 of 25 objectives 
mastered) has been established which represents a high level of mathematics 
achievement. Chart 2 (p. 15) illustrates that, statewide, students 
demonstrated strength (85% or more students achieving mastery) in the basic 
conceptual and computational skills and simple applications objectives of 
determining one and ten more/less than a number; addition/subtraction facts 
with and without regrouping; identifying shapes/angles/sides and objects in 
arrays; rewriting numbers using expanded notation; reading and interpreting 
graphs and tables; telling time; determining the value of a set of coins; 
identifying number sentences and needed information from problems; and solving 
story problems with addition and subtraction. However, students did not 
perform as effectively (only 50% of the students achieving mastery) on the 
objective of rewriting numbers by regrouping. This objective assesses the 
understanding of place value as well as regrouping for multi-digit computation. 

There continues to be a consistent pattern throughout the mathematics subtests 
of student strengths in primarily computational skills and easy one-step 
routine applications. These strengths are offset by a pattern of student 
weaknesses on higher order objectives. For example, students are consistently 
strong in their ability to recall number facts and compute with whole 
numbers. However, there is a weakness in regrouping and estimating. 

Students getting fewer than 69 questions correct on the 100-question 
mathematics section (11.6% of fourth grade students tested) were identified as 
needing further diagnosis and possible remedial instruction. 



Language Arts 

In language arts, fourth-grade students averaged 6.3 objectives, or 70.0% of 
the 9 objectives tested* The state's goal is that all students master every 
objective. Chart 3 (p. 16) illustrates that students did reasonably well on 
writing mechanics, as well as locating information and literal reading 
comprehension. However, weaknesses were found in the higher order inferential 
and evaluative listening and reading comprehension objectives. These results 
indicate that students need to learn more effective comprehension strategies 
while simultaneously being exposed to a wide variety of reading selections. 
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In writing, fourth-grade students averaged 4.9 points on a scale of 2 
through 8. The state's goal is that all students be able to produce an 
organized, well-supported piece of writing, that is, a holistic score of 7 
or 8. Chart 4 (p. 17) illustrates that 14% of the students produced an 
organized, well-supported piece of writing (scores of 7 or 8), and an 
additional 44% produced a paper which is generally well organized (scores of 5 
or 6). A total of 31% of the students scored a 4, which indicates minimally 
proficient writing, while the remaining 11% scored below the remedial standard 
(scores of 2 or 3)* 

In reading (Degrees of Reading Power test), fourth-grade students average 49 
units on a scale of 15 through 84. The state's goal is that all students be 
able to read with high comprehension those materials typically used at the 
fourth grade or above; that is, at least 50 on the DRP unit scale. Chart 5 
(p. 18) illustrates that 53% of the students scored at least 50 on the DRP 
score scale, 23% scored between 41 and 49 and 24% scored below the remedial 
standard of 41. The average score of 49 suggests that Connecticut fourth 
graders typically can read and comprehend expository materials normally used 
up to grade four. These results indicate that students will probably benefit 
from continued exposure to ncnfiction materials in the primary grades. 
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CHART 4 
WRITING SAMPLE: 
PERCENT OF STUDENTS AT EACH SCORE POINT 
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This bar chart illustrates the distribution of students who received each holictic writing 
score, statewide. Holistic writing scores are interpreted as follows: a student who scores 7 
or 8 has produced a paper which is well written with developed supportive detail; a student 
who scores 5 or 6 has produced a paper which is generally well organized with supportive 
detail; a student who scores 4 is minimally proficient; and a student who scores 2 or 3 is in 
need of further diagnosis and possible remedial assistance. 
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CHART 5 

DEGREES OF READING POWER'^CDRPf : 
PERCENT OF STUDENTS AT SELECTED RANGES OF 

DRP UNIT SCORES 
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This bar chart illustrates the distribution of students, statewide, scoring in each of three 
Degrees of Reading Power (DRP) score categories. DRP score categories are Interpreted 
as follows: a student who scores 50 DRP units or above has met the statewide Reading 
Goal and can read, with high comprehension, materials which are typically used at grade 4 
or above; a student who scores 41-49 DRP units can read, with high comprehension, 
materials which are typically used below grade 4 but above the Remedial Standard; and a 
student who scores 40 DRP units or below is in need of further diagnosis and possible 
remedial assistance. 
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COMPARISON OF 1985 THROUGH 1991 TEST RESULTS 



Charts 6-12 (pp. 21-27) address the comparison of the 1985 through 1991 test 
results. Charts 6 (p- 21), 9 (p- 24) and 10 (p. 25) present a comparison of 
statewide average scores on the four subtests, a comparison of the percent of 
students scoring at or above the remedial standard and a comparison of the 
percent of students scoring at or above the statewide goals, respectively* 
The remaining four charts provide a comparison of the percent of students 
achieving mastery in each mathematics objective (Chart 7, p* 22) and each 
language arts objective (Chart 8, p. 23), a comparison of student achievement 
in relation to the remedial standards (Chart 11, p. 26), and a comparison of 
student achievement in relation to the goals (Chart 12, p. 27)* 

Chart 6 (p. 21) shows that the statewide average scores increased in all areas 
tested when 1991 results are compared to 1985 results. In mathematics, the 
average number of objectives mastered increased from 19.3 in the initial 
assessment in 1985 to 21.2 in 1991. Mathematics scores have either increased 
slightly or remained unchanged in each of the test administrations indicating 
a positive trend. DRP reading performance has also been moving slowly in a 
positive direction. While the average DRP score was unchanged from 1988 to 
1989, there has been a one point increase in each other year moving from 43 in 
1985 to 49 in 1991. The average number of language arts objectives mastered 
has increased slightly over the life of the CMT program from 6J objectives 
mastered in 1985 to 6.3 mastered in 1991. Student performance on the writing 
samples showed some progress from 1985 to 1991 with the average holistic score 
increasing from 4.8 to 4»9. 

Chart 7 (p. 22) lists the percent of students at mastery for each of the 25 
mathematics objectives. From 1985 to 1991, 24 objectives have shown a gain in 
percent of students at or above mastery and 1 has declined slightly. A 
comparison of the 1991 and 1985 results shows large gains (at least 10 
percentage points) in the percent of students meeting the mastery standard in 
the following objectives: rewriting numbers by regrouping; identifying 
fractional parts; relating multiplication/division facts to pictures; 
estimating sums and differences; reading and interpreting tables/charts; 
identifying number sentences from pictures; and estimating lengths and areas. 

Chart 8 (p. 23) lists the percent of students at mastery for each of the 9 
language arts objectives. From 1985 to 1991, 6 objectives have shown a gain 
in percent of students at or above mastery and 3 objectives have declined. 

When 1991 results are compared with 1985, inferential reading comprehension 
showed the most improvement in the percent of students at mastery with a 15 
percentage point gain. 

Chart 9 (p. 24) compares the percent of students who scored at or above the 
remedial standard in mathematics, writing and reading (DRP) for 1985 through 
1991. In each ccntent area there has been a gain in the percent of students 
meeting the remedial standard over the seven CMT administrations. In 
mathematics, the remedial standard is 69 out of 100 items correct. There was 
an 8 percentage point increase in performance at or above the remedial 
standard from 1985 (80%) to 1991 (88%). In writing, the remedial standard is 
4 on a scale from 2 to 8. The percent of students scoring at or above the 
remedial standard increased from 81% in 1985 to 89% in 1991. In reading (DRP) 
the remedial standard is 41 DRP units with 70% comprehension. There was an 
8 percentage point increase in performance at or above the remedial standard 
from 1985 (68%) to 1991 (76%). 
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Chart 10 (p. 25) coppares the percent of students scoring at or above the 
statewide goals in mathematics, writing and reading from 1985 through 1991. 
In mathematics, the goal is 22 of 25 objectives mastered. There was a 20 
percentage point increase in performance at or above the statewide goal from 
1985 (42%) to 1991 (62%). In writing, the goal is 7 on a scale of 2 to 8. 
The percent of students scoring at or above the statewide standard decreased 
slightly from 17% in 1985 to 14% in 1991. In reading (DRP) the statewide goal 
is 50 DRP units with 70% comprehension. There was an 11 percentage point 
increase in performance at or above the goal from 1985 (42%) to 1991 (53%). 

Chart 11 (p* 26) is a comparison of student achievement in relation to the 
remedial standards for 1985 through 1991. Over the seven-year period, the 
percent of students at or above the remedial standard on all three tests 
(mathematics, reading, writing) has increased from 58.6% in 1985 to 68.5% in 
1991, while the percent of students below the remedial standard on all three 
tests has declined from 8.2% in 1985 to 3.5% in 1991. The percent of students 
below the remedial standard on one or more subtests has also dropped from 
40.4% in 1985 to 30.0% in 1991. 

Chart 12 (p. 27) is a comparison of student achievement in relation to the 
goals for 1985 through 1991. Over the seven-year period, there has been a 
slight Increase in the percent of students reaching the statewide goal on all 
three tests (mathematics, reading, and writing), while the percent of students 
below the statewide goal on all three tests has declined from 43.2% in 1985 to 
28.0% in 1991. The percent of students above the statewide goal on one or 
more subtests has increased from 55.6% In 1985 to 69.8% In 1991. 



Test Results by District 

Appendices H and I address the comparison of test results by school district. 
Appendix H (p. 73) and Appendix I (p. 81) present a listing of the mathematics 
and language arts test results, respectively, for each Connecticut school 
district. In each appendix, school districts are listed alphabetically, 
followed by regional school districts. The Type of Community (TOO 
designation in the second column and the Education Reference Group (ERG) 
designation in the third column Indicate the TOC and ERG groups with which 
each district or school has been classified. Definitions of the TOC and ERG 
classifications are provided In Appendix J (p. 89) and Appendix K (p. 91), 
respectively. TOC and ERG summaries follow the alphabetical listings of 
school district results in mathematics and language arts. 

The State Department of Education advises against comparing scores between and 
among school districts. It is more meaningful to compare district results 
longitudinally within each district. It is also not appropriate or meaningful 
to sum across the different tests and subtests for comparative purposes 
because of differences in test length, mastery criteria and remedial 
standards. These comparisons are inappropriate because it is impossible to 
Identify, solel ^ on the basis of this Information, how the average student has 
performed In the districts being compared. Average scores and standard 
deviations provide more appropriate comparative information on how well the 
average student is performing, although many factors may affect the 
comparability of these statistics as well. 
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CHART 7 

MATHEMATICS: COMPARISON OF THE PERCENT OF STUDENTS 
ACHIEVING MASTERY IN EACH OBJECTIVE FOR 1985 THROUGH 1991 



OBJECTIVE 


PERCENT OF STUDENTS 
AT MASTERY 


PERCENTAGE 
POINT 
GAIN FROM 
1985 TO 1991 




1985 


1986 


1987 


1988 


1989 


1990 


1991 




CONCEPTUAL UNDERSTANDJNGS 


















\. DETERMINE \ AND 10 MORE/LESS THAN « 


9l°o 


92% 


93% 


91% 


91*0 


93*0 


94*0 


3*o 


2 E XTE ND PATTER NS.# SAND ATTRIBUTES 


72-c 


75% 


78% 


69*o 


71% 


77*o 


78*o 


6*o 


3. ORDER WHOLE NUMBERS 


78*o 


82% 


84% 


83 *o 


83*o 


83*0 


84*o 


6*o 


4 REWRITE #'S BY EXPANDED NOTATION 


96«o 


96% 


96*o 


96*o 


96*o 


95*^0 


95*o 


-1*0 


5. REWRITE a s BY REGROUPING 10 S & I S 


35«'o 


39% 


41% 


45*o 


48*'o 


49*o 


50*0 


l5*o 


6 ID FRACTIONAL PARTS OF REGlONS'SETS 


73'*o 


85% 


86% 


90*0 


90% 


83*0 


84*o 


11*o 


7. RELATE MULT/DIV FACTS TO PICTURES 


54*»o 


61% 


62*0 


59*0 


eo% 


71*o 


71*c 


17*o 


COMPUTATIONAL SKILLS 


















8. ADDITION'SUBTRACTION FACTS TO 18 


9l'»o 


97'»o 


97*o 


98*0 


98% 


97*0 


97% 


6*o 


9. ADD/SUBTRACT WITHOUT REGROUPING 


95% 


96-0 


97*o 


97*0 


97*o 


96* o 


96*o 


l*o 


10. ADD 1- & 2'DlGlJ a s WITH REGROUPING 


89% 


87% 


88% 


84*o 


85% 


92*0 


92*o 


3*o 


11. ESTIMATE SUMS.DIFFERENCES TO lOO 


28**o 


46"o 


52*o 


49* o 


5l*o 


59* o 


59* o 


31 *o 


12. MULTIPLY/DIVIDE BY 2. 5. 10 


79«*o 


80«»o 


81% 


78*o 


78*o 


80*0 


80*i, 


1*o 


PROBLEM SOLVING/APPLICATIONS 


















13. IDENTIFY OBJECTS/NUMBERS IN AN ARRAY 


82*0 


87»o 


88*o 


89% 


90*0 


87*o 


87*o 


5*0 


1A DC Art/IKITCDDDCT A Cll_l C yDl^ T/^/^D ADl_lC 

i4. ntAU/IN 1 tnr'nc 1 LinAr'no/r'IU 1 L»L>r(Ar no 


89^o 


90S 


91% 


92*o 


93*o 


95*o 


95*o 


6*o 


15 READ-INTERPRET TABLES/CHARTS 


78*o 


84''o 


86*c 


90% 


91*0 


92*o 


92*o 


14*o 


16 ID NUMBER SENTENCES FROM PICTURES 


57»o 


58% 


60*0 


60*0 


62* o 


79*o 


79*o 


22*« 


17. ID NUMBER SENTENCES FROM PROBLEMS 


91% 


91% 


92*o 


93% 


93*o 


93*0 


93*0 


2*o 


18 SOLVE STORY PROBLEMS WITH 


83*'o 


76*0 


78*o 


85'*o 


85*o 


91*0 


91^0 


8*o 


19. SOLVE STORY PROBS WITH EXTRA INFO 


73" o 


63*o 


65*o 


78*o 


79*o 


77*0 


78*0 


5*o 


20 IDENTIFY NEEDED INFO IN PROBLEMS 


79% 


82% 


83*o 


83% 


83% 


87% 


87*0 


8*o 


MEASUREMENT/GEOMETRY 


















21 MEASURE LENGTHS/IDENTIFY UNITS 


76% 


79% 


81 *o 


82% 


83*^0 


78% 


78*0 


2*^0 


22 ESTIMATE LENGTHS^AREAS 


70«o 


79% 


81*o 


72% 


72*/o 


80% 


80*0 


10*0 


23 TELL TIME TO NEAREST 1. 1/2. 1/4 HOUR 


86*0 


90% 


91*o 


94% 


95*o 


91*o 


91*o 


5% 


24 DETERMINE VALUE OF A SET OF COINS 


91% 


93% 


94% 


92% 


92*o 


92*o 


93*o 


2*o 


25. IDENTIFY SHAPES'ANGLES^SlDES 


97*.. 


97% 


97*o 


97*o 


97*o 


99% 


99*o 


2*o 
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CHART 8 

LANGUAGE ARTS: COMPARISON OF THE PERCENT OF STUDENTS 
ACHIEVING MASTERY IN EACH OBJECTIVE FOR 1985 THROUGH 1991 



OBJECTIVE 


PERCENT OF STUDENTS 
AT MASTERY 


PERCENTAGE 
POINT 
GAIN FROM 
1985 TO 1991 




1985 


1986 


1987 


1988 


1989 


1990 


1991 




WRITING MECHANICS 


















1. CAPITALIZATION AND PUNCTUATION 




OO 'O 


85% 


70°b 


72% 


71% 


70% 


-4% 


2. SPELLING/HOMONYMS/ABBREVIATIONS 


66% 


62% 


62% 


68% 


67% 


71% 


70% 


** o 


3. AGREEMENT 


80% 


81% 


82% 


84% 


84% 


83% 


83% 


3% 


LOCATING INFORMATION 


















4. SCHEDULES/MAPS/BOOKS/DlCTlON ARIES 


81% 


85% 


87% 


88% 


89% 


88% 


87% 


D 'O 


LISTENING COMPREHENSION 


















5. LITERAL 


73% 


54«o 


55% 


68% 


68% 


66°i, 


67% 


-6% 


6. INFERENTIAL'EVALUATIVE 


60% 


64% 


66% 


74% 


74% 


57% 


57% 


-3% 


READING COMPREHENSION 


















7. LITERAL 


67% 


71% 


73''i, 


65% 


66% 


72% 


72% 


5% 


8. INFERENTIAL 


51% 


58% 


60% 


52% 


53"'.o 


67% 


66% 


15% 


9. EVALUATIVE 


55°b 


52% 


54% 


51% 


52% 


58% 


57% 


2% 
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CHART 9 

COMPARISON OF THE PERCENT OF STUDENTS 
SCORING AT OR ABOVE THE REMEDIAL STANDARD 
IN EACH SUBJECT AREA FOR 1985 THROUGH 1991 
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100 
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76% 
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CHART 10 

COMPARISON OF THE PERCENT OF STUDENTS 
SCORING AT OR ABOVE THE GOAL 
IN EACH SUBJECT AREA FOR 1985 THROUGH 1991 
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Normative Results 



Normative information is provided to indicate how well the average student in 
Connecticut performs compared to a national sample of students. Norms have 
been available for the mathematics test, the language arts test and the 
reading comprehension test since 1987* This year, for the second year, 
normative information is also being provided for mathematics problem solving. 
These norms are based on links established between the CMT and the sixth 
edition of the Metropolitan Achievement Test (MAT-6). The norms are expressed 
in percentile ranks which provide estimates of group performance relative to 
the performance of the national MAT-6 norm group. Percentile ranks range from 
1 to 99. A percentile rank of 50 represents the score that divides the norm 
group into two equal parts; half scoring below '^nd half scoring above this 
value. Each reported percentile rank represents the performance of a 
nationally representative sample of students in relation to Connecticut 
student performance. 

The following are the estimated norms for the grade four statewide averages. 
In the content areas of total mathematics, language arts and reading 
comprehension (not DRP), data are provided for the 1987 through 1991 
administrations. Normative information in the content area of mathematics 
problem solving is presented for the 1990 and 1991 administrations only. 

Grade Four 





1987 


1988 


1989 


1990 


1991 


Total Mathematics 


67 


66 


67 


68 


68 


Language Arts 


69 


70 


69 


67 


66 


Reading Comprehension 


60 


58 


59 


58 


56 


Mathematics Problem Solving 








68 


69 



Patterns in the data are summarized below. 

0 In each content area and administration year, the mean national 

percentile rankings of Connecticut students substantially exceed the 
national average (50th percentile rank). 

0 The norms for mathematics and language arts have remained similar to 
one another over the five years with percentile ranks ranging from 66 
to 70 in value. In 1991 the reading comprehension performance 
continues to be lower than either mathematics or language arts when 
compared to a national sample. 

0 The percentile ranks within each content area are quite stable across 
the five years, differing In value by no more than four points. 

It should be pointed out that these norms provide a way to interpret the 
performance of the average Connecticut student relative to a national sample. 
They do not address the issue of how Connecticut, as a state, compares to 
other states. The fact that, in 1991, the average Connecticut student is at 
the 68th percentile in mathematics does not mean that the state as a whole 
would be in the 68th percentile if it were compared to other states. A 
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state-by-state achievement testing program has been endorsed by the Council of 
Chief State School Officers (CCSSO) and the National Governors' Association 
(NGA) and is in progress using the National Assessment of Educational Progress 
(NAEP) Program. Connecticut participated in the 1990 trial state assessment 
for mathematics at grade eight. Results of this assessment were released June 
6, 1991, at a national press conference in Washington, D.C. In addition, 
Connecticut participated in the 1992 trial state assessment in grades four and 
eight. 



Norms Available to Districts 



Total mathematics, language arts, reading comprehension and mathematics 
problem solving norms can also be calculated for groups of students at the 
district level. Each year all districts are notified by the CMT contractor 
that norms for their own districts and schools within their districts are 
optionally available. In addition, districts are offered all materials and 
directions necessary to hand-calculate norms for groups of students within 
their districts (e.g., Chapter I students). There is no charge for either of 
these services. Any district that requests this information receives it 
directly from the CMT contractor. No district receives normative information 
unless it is specifically requested by the superintendent. Over one half of 
Connecticut school districts have requested norms in the past. 



Participation Rate Results 



Appendix L (p. 95) presents the number of fourth-grade students in each 
district and the percents of students who participated in the grade four 
mastery testing during the fall 1991 statewide administration. Appendix L 
also shows the percent of students exempted from CMT testing. The 
alphabetical listing of districts provides the following information for each 
district: 



Column 1 The name of the district 

Column 2 The total fourth-grade population at the start of mastery 
testing 

Column 3 The number of students eligible for testing 

Column 4 The percent of total population exempted from testing 

Columns 5-8 The percent of eligible students tested in each content area 



The results in Appendix L illustrate that participation rates by school 
district on the fourth-grade CMT were quite high, with only a few exceptions. 
However, the high percentage of students exempted from the CMT, statewide, 
combined with the large variation in exemption rates among districts, has 
raised concerns about the fair application of exemption procedures and its 
impact on students. The Department has examined the impact of the exclusion 
provisions on the CMT programs for Special Education and bilingual students. 
The results from these analyses are available from the Division of Research, 
Evaluation, and Assessment. 
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Test Construction 

The development of the fourth-grade criterion-referenced mastery test required 
the formation of seven statewide advisory committees. These included the 
Mathematics and Language Arts Advisory Committees, the Psychometri cs Advisory 
Committee, the Bias Advisory Committee, the Connecticut Student Assessment 
Advisory Committee (formerly the Mastery Test Implementation Advisory 
Committee), and two standard-setting committees, one for mathematics and one 
for language arts. These committees were comprised of representatives from 
throughout the state. Members were selected for their area of expertise. 
Approximately 150 Connecticut educators participated on the mastery test 
committees which met over 80 times during the first 18 months of test 
development. (See Acknowledgements, p. v and page 44.) 

Beginning in the spring of 1984, content committees in both language arts and 
mathematics participated in each stage of the test development process, 
including assisting the State Department of Education in the selection of The 
Psychological Corporation as Us test contractor. First, the content 
committees reviewed the curriculum materials prevalent throughout the state 
and the scope of the national tests in use in Connecticut at the respective 
grade levels. Additional resources included the Connecticut curriculum guides 
in mathematics and language arts, developed in 1981, as well as the results of 
recent Connecticut Assessment of Educational Progress (CAEP) assessments in 
mathematics and language arts. Next, the committees identified sets of 
preliminary mathematics and language arts objectives which reflected existing 
curriculum materials and the goals of the mastery testing program. The 
content committees defined an objective as an operationalized learning outcome 
that was fairly narrow and clearly defined. 

Four criteria were used in identifying the appropriate learning outcomes or 
test objectives and in selecting specific test items to be included on the 
Grade 4 Connecticut Mastery Test (CMT). To have been considered for use, test 
objectives and items must have been: 

(1) significant and important; 

(2) developmental ly appropriate; 

(3) reasonable for most students to achieve; and 

(4) generally representative of what is taught in Connecticut schools. 

Once the objectives were identified, item specifications and/or sample items 
were written. Item specifications are written descriptions of the types and 
forms of test items that assess an objective. They also prescribe the types 
of answer choices that can be used with each item. 

After the test specifications were written and agreed upon, the test 
contractor wrote items and response choices for each of the objectives. The 
items were then reviewed by the content committees. Items which met the 
criteria of the test specifications and received the approval of the content 
committees were considered for the pilot test. Before testing, the Bias 
Advisory Committee reviewed each item for potential discrimination related to 
gender, race or ethnicity in the language or format of the question or 
response choices. After their review was completed, the pilot test forms were 
constructed. Over 500 customized Connecticut items were included in the 
October 1984 grade four pilot test in language arts and mathematics. 
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The Psychometrics Advisory Committee provided advice concerning other aspects 
of the pilot test including the sampling design, statistical bias analysis, 
the design of item specifications and pilot test administration procedures. 
The recommendations proposed by the Psychometrics Advisory Committee were 
reviewed and endorsed by the Connecticut Student Assessment Advisory Committee. 



Pilot Tests 

After the items had been reviewed, twelve test forms (six in mathematics and 
six in language arts) were piloted for the grade four test. The purpose of 
several pilot test forms was to ensure that enough test items were included to 
construct three comparable test forms from the pilot test results. 

Over 6,000 grade four students participated in the October 1984 pilot test. 
In January 1985, the pilot test results were made available to Connecticut 
State Department of Education (CSDE) staff. The process of selecting items to 
construct three comparable test forms began by the Bias Advisory Committee 
examining the pilot test statistics of each item for potential bias. As a 
result, some items were eliminated from the item pool. From the remaining 
items, test forms were constructed to be equivalent in content and difficulty 
at both the objective and total test levels. 

Once the items were sorted on this basis, the test contractor prepared three 
complete forms of the mathematics test and two complete forms of the language 
arts test. These forms were approved by the content committees. Each form 
was created to be equal in difficulty and test length. A third language arts 
test was constructed after a few additional items were piloted as part of a 
later test administration. Later, during subsequent CMT administrations, 
enough items were pilot tested to yield two additional test forms. The 
psychometric procedures used to construct each of these test forms focused 
primarily on the use of the one-parameter item response model. 



Survey 

In October 1984, a survey of preliminary grade four mastery test objectives 
was sent to over 3,000 Connecticut educators. The purpose of the survey was 
to determine (1) the importance of the proposed mathematics and 
reading/language arts objectives and (2) whether the objectives were taught 
prior to the beginning of grade four. Over a 50% response rate was achieved 
which included approximately one-third of the respondents representing urban 
school districts. As a result of the survey, two objectives were not 
considered to be important learning outcomes before fourth grade and 
consequently were eliminated from the fourth-grade language arts test by the 
Language Arts Advisory Committee. 
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Grade Four Mathematics Objectives 



The 25 objectives of the fourth-grade mathematics test are listed below. 
There are four test items for each objective. The number of items in each 
domain is indicated in the parentheses. 



CONCEPTUAL UNDERSTANDINGS (28) 

1. Identify the number one more, one less, ten more or ten less 
than a given number 

2. Extend patterns involving numbers and attributes 

3. Order whole numbers 

4. Rewrite numbers using expanded notation 

5. Rewrite numbers by regrouping tens and ones 

6. Identify fractional parts of regions and sets from pictures 
for halves, thirds, fourths and sixths 

7. Relate multiplication and division facts to rectangular arrays 

COMPUTATIONAL SKILLS (20) 

8. Know addition and subtraction facts to 18 

9. Add and subtract one- and two-digit numbers without regrouping 
10. Add one- and two-digit numbers with regrouping 

n. Estimate sums and differences to 100 

12. Multiply and divide by 2» 5 and 10 

PROBLEM SOLVING/APPLICATIONS (32) 

13. Identify objects or numbers that do or do not belong in a 
collection, matrix, or array 

14. Read and interpret bar graphs and pictographs 

15. Read and interpret data from tables and charts 

16. Identify or write number sentences from pictures 

17. Identify number sentences from addition or subtraction story 
problems 

18. Solve simple story problems involving addition or subtraction 

19. Solve and identify number sentences in simple story problems 
involving addition and subtraction, with extraneous 
information 

20. Identify needed information in problem situations 
MEASUREMENT/GEOMETRY (20) 

21. Measure length and identify appropriate units for measuring 
length and distance 

22. Estimate lengths and areas 

23. Tell time to the nearest hour, half hour and quarter hour, 
using analog and digital clocks 

24. Determine the value of a set of coins 

25. Identify shapes, angles and sides 



Performance on all 25 objectives is reported at the student, classroom, 
school, district and state levels. 
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Grade Four Language Arts Objectives 
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Grade Four Language Arts Objectives 



There are nine multiple-choice objectives and two holistic measures, one for 
reading and one for writing, within the fourth-grade language arts test. The 
number of items for each content area or objective is indicated in the 
parentheses. 



WRITING MECHANICS (36) 

1. Capitalization and Punctuation (12) 

2. Spelling Words, Homonyms and Abbreviations (9) 

3. Agreement (15) 

LOCATING INFORMATION (11) 

4. Schedules, Maps, Table of Contents, Title Page 
and Dictionary (11) 

LISTENING COMPREHENSION (20) 

5. Literal (7) 

6. Inferential and Evaluative (13) 
READING COMPREHENSION (36) 

7. Literal (12) 

8. Inferential (14) 

9. Evaluative (10) 

DEGREES OF READING POWER (56) 

WRITING SAMPLE (1) 

Holistic scoring is provided for all students. Analytic scoring is 
provided for students who score at or below the remedial standard of 
4 (on a scale of 2-8). 



Performance on all nine Language Arts objectives, the Degrees of Reading Power 
and Writing Sample is reported at the student, classroom, school, district and 
state levels. 
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Standard-Setting Committees 
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Remedial (Grant) Standard-Setting Process 



Background 

There are several acceptable strategies for setting standards on 
criterion-referenced tests* Each of the proposed methods has one or more 
unique characteristics. One common element to the various methods is that 
they all offer to the individuals who are setting the standards some process 
which reduces the arbitrariness of the resulting standard. Different methods 
accomplish this in different ways. All methods systematize the standard- 
setting process so that the result accurately reflects the collective informed 
judgment of those setting the standard. 



Types of Standard-Setting Methods 

Standard-setting methods can generally be categorized into three types: test 
question review, individual performance review and group performance review. 
Test question review methods specify a procedure for standard setters to 
examine each test question and make a judgment about that question. For 
example, standard setters might be asked to rate the difficulty or the 
importance of each question. These judgments are combined mathematically to 
produce a standard. Individual performance review methods also require 
standard setters to make judgments, but the judgments are made on the basis of 
examining data that indicate how well individual students perform on test 
items. These data may be based on actual pilot test results or projected 
results using mathematical theories. In this method^ additional student 
information, such as grades, may also be used to inform the standard setters. 
Group performance review methods provide for judgments to be made based on the 
performance of a reference group of students. That is, standard setters 
review the group performance and make a determination where the standard 
should be set based on the group results. 



Selection of a Standard-Setting Method 

Several factors affect the choice of a particular standard-setting method. 
The type of test is one consideration. For example, some methods are only 
appropriate for multiple-choice questions or for sin&le correct answer 
questions while other methods are more flexible. For instance, time 
constraints are a consideration if student performance data are necessary. 
In this case, a pilot test must be conducted and the test results must be 
analyzed prior to setting the standards. Another consideration is the 
relative importance of the decisions that will be made on the basis of the 
standard. For example, a classroom test affecting only a few students would 
not require as stringent a procedure as would a statewide test determining 
whether a student is allowed to graduate from high school. Other relevant 
factors include the number of test items, permanence of the standard, purpose 
of the test and the extent of available financial and other resources to 
support the standard-setting process. 



On February 4, 1985, the Mastery Test Psychometrlcs Advisory Committee met to 
consider the issue of standard-setting procedures and voted unanimously to 
approve the following proposal. 



A PROPOSAL FOR SETTING THE REMEDIAL STANDARDS ON THE CONNECTICUT MASTERY TESTS 

1. Two standard-setting committees will be created: one for mathematics and 
one for reading and writing. 

2. This description of a minimally proficient student will be given to each 
of the committees: 

Imagine a student who is just proficient enough in reading, writing 
and mathematics to successfully participate in his/her regular 
fourth-grade coursework. 

3a. In mathematics, an adaptation of the Angoff procedure will be used. 

The committee will be provided with each item appearing on one form of the 
mathematics test. The committee will be given the following directions: 

Consider a group of 100 of these students who are just proficient 
enough to be successful in regular fourth-grade coursework. How many 
of them would be expected to correctly answer each of the questions? 

The committee will rate each item. The committee will then be given the 
opportunity to discuss their rating of each item. Sample pilot data will 
be presented. Committee members will be given the opportunity to adjust 
their item ratings. The item ratings will then be averaged in accordance 
with the Angoff procedure in order to produce a recommended test standard. 

b. In reading, the committee will review and discuss each passage of the 
Degrees of Reading Power (DRP) test. Student performance data will be 
presented. The committee will consider the reading difficulty that should 
be expected of a student at the grade level being tested. The committee 
members will identify the passage that has the appropriate level of 
reading difficulty consistent with the above description of a minimally 
proficient student. 

c. In writing, the committee will read four sample essays. These essays 
will have been prescored holistically (on a scale from 2 to 8) in order to 
rank the quality of the essays. Committee members will classify essays 
into one of three categories: 1) definitely NQI proficient, 2) borderline 
and 3) definitely proficient. These classifications will be discussed in 
light of the holistic scores. The committee will then classify 
approximately twenty-five additional essays. The essay ratings will be 
discussed in the same manner as the original four essays. Hhen all essays 
have been discussed, the essays which fell in the borderline category will 
be focused upon to determine the standard. The committee will determine 
where, among the borderline essays, the standard should be established. 

4. The standards recommended in step 3 will be presented to the Connecticut 
Student Assessment Advisory Committee (formerly the Mastery Test 
Implementation Advisory Committee) for discussion and action. 
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Connecticut's Strategy 



Several steps were employed to create an acceptable and valid test standard 
for Connecticut tests. Initially, a separate standard-setting committee was 
convened for each test on which standards were to be set. Individuals were 
chosen to serve as members on the committee on the basis of their familiarity 
with the area being assessed and the nature of the examinees. One source of 
such members was the test content committees related to the project. For 
example, members of the Mathematics Advisory Committee were represented on the 
committee setting standards for the mathematics mastery test. 

The actual procedures used to set standards were an adaptation of a method 
proposed by William Angoff (1970). This test question review method required 
members of a standard-setting committee to estimate the probability that a 
question would be correctly answered by examinees who possess no more than the 
minimally acceptable knowledge or skill in the areas being assessed. Standard 
setters then reviewed pilot test data for sample items as further evidence of 
the appropriateness of the judgments being made. The original probability 
estimates assigned to each test question were reviewed and adjustments made by 
the standard setters. The final individual item probabilities were summed to 
yield a suggested test standard for each member of the committee. The 
suggested standards were averaged across members of the committee to produce 
the recommended test standard. 

The recommended test standard was presented to the Connecticut Student 
Assessment Advisory Committee and the State Board of Education. 

In mid-March 1985, Mathematics and Language Arts Standard-Setting Committees 
met to set the remedial standards for the Grade 4 Mastery Test. The following 
information summarized the results of the standard-setting activities 
conducted by CSDE staff: 

I. Mathematics (100-item test) 

Using the procedures previously outlined, the standard setters rated each item 
and considered the pilot data. Committee members discussed items and were 
given the opportunity to adjust their initial ratings. The final ratings were 
averaged to produce a remedial standard. It was recommended that a raw score 
of 69 be the remedial mathematics standard. Below is a summary of the ratings. 

Procedure # Judges Range t Mean X Correct Raw Score 

Angoff ?1 56.7-81 .3 68.7 68.7 

II. Reading (Degrees of Reading Power, 56-item test) 

Standard setters used two procedures to establish a remedial reading 
standard. First, they examined the passages in the Degrees of Reading Power 
(DRP) test, asking themselves which passage is too difficult for the student 
who is just proficient enough to successfully participate in fourth-grade 
coursework. Discussion occurred throughout this selection process. 
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Second ♦ they examined textbooks which are typically used in grades three and 
four and selected those textbooks which a minimally proficient student would 
not be expected to read in order to successfully participate in fourth-grade 
coursework. Discussion occurred throughout this selection process. 

The average readability values of the selected passages and textbooks and the 
pilot test data were then revealed to the standard setters. The standard 
setters discussed the readability values and the pilot test data and 
recommended the DRP unit score of 41 as the remedial standard. This standard 
was accepted by the State Board of Education at the 70% comprehension level. 
Below is a summary of the ratings* 

Readabi 1 i ty Recommended 
Procedure # Judges Range Remedial Standard 

A. Test Passage Review 17 42-48 DRP Units 

41 DRP Units 

B. Textbook Review 17 42-51 DRP Units 
III. Writing (45-minutb writing sample) 

Using the procedure previously outlined, standard setters read and rated 21 
essays written to a narrative prompt and 21 essays written to an expository 
prompt. After discussions and final ratings, the holistic scores for the 
papers were revealed to the group. The committee then discussed the 
appropriate remedial writing standard in light of the degree to which their 
ratings matched the holistic scores. It was the recommendation of the 
committee that a holistic writing score of 4 be used as the remedial writing 
standard. Below is a summary of the ratings. 



NARRATIVE PROMPT 



Rating After Discussion 



Holistic 


Definitely 




Definiteiy 
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NOT Proficient 
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Standard-Setting Coimnittees 



LANGUAGE ARTS STANDARD-SETTING COMMITTEE 

Evelyn P. Burnham, Colebrook Public Schools 
Nicholas P. Criscuolo, New Haven Public Schools 
Mary R* Fisher, Thompson Public Schools 
Marguerite Fuller, Bridgeport Public Schools 
Anne Jackel , Thompson Public Schools 
Dorothy Kaplan, Middletown Public Schools 
Robert Kinder, CT State Department of Education 
Bob Lincoln, Toll and Public Schools 
Virginia Lity, Bridgeport Public Schools 
Virginia Manulls, Colebrook Public Schools 
Noreen McDermott, Hartford Public Schools 
Elizabeth Nelligan, Canton Public Schools 
Dorothy Nevers, Canton Public Schools 
Carol D. Parmelee, Middletown Public Schools 
Beverly R. Peterman, Stamford Public Schools 
Geraldine Smith, Canton Public Schools 
Mary Wei nl and, CT State Department of Education 



MATHEMATICS STANDARD^SETTING COMMITTEE 

Betsy Andersen, Manchester, Connecticut 
Betsy Carter, CT State Department of Education 
Geraldine M* Cemprola, Ridgefield Public Schools 
Linda Cherry, Suf field Public Schools 
Elizabeth B. Cubeta, Middletown Public Schools 
Corretta K. Dean, Bridgeport Public Schools 
Tony Ditrio, Norwalk Public Schools 
Anita Gaston, Bloomfield Public Schools 
Janet Heintz, Farmington Public Schools 
Mary Anna Keough, Meriden Public Schools 
Steven Leinwand, CT State Department of Education 
Wesley Masten, Norwalk Public Schools 
Irene B. Moriarty, Meriden Public Schools 
Pamela Munro, Windham Public Schools 
Eileen O'Reilly, Manchester Public Schools 
Lois Piper, Norwalk Public Schools 
Twila Pollard, New Haven Public Schools 
Rosemary Powers, Bloomfield Public Schools 
Sylvia Webb, Middletown Public Schools 
George A. Wells, New Haven Public Schools 
Frank K. Whittaker, Bridgeport Public Schools 
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APPENDIX E 
Grade Four Overview of Holistic Scoring 

and 

Marker Papers for Holistic Scoring 



An Overview of Holistic Scoring 



Description of the Method 

Holistic scoring involves judging a writing sample for its total effect. 
The scorer makes an overall evaluation taking into account all characteristics 
which distinguish good writing. No one feature (such as spelling, rhetoric, 
or organization) should be weighted to the exclusion of all other features. 
Contributing to the rationale underlying holistic scoring is evidence that: 

0 no aspect of writing can be judged independently and result in an 
overall score of quality; 

0 teachers can recognize and concur upon good writing samples; and 

0 teachers tend to rank entire pieces of writing in the same way, 
regardless of the importance they might attach to the particular 
components of writing. 

The scoring scale for holistic scoring is determined by the quality of the 
specific samples being evaluated. That is, the success of a particular 
response is determined in relationship to the range of ability reflected in 
the set of writing samples being assessed. 

Preparation for Scoring 

Prior to the training/scoring sessions, a committee consisting of Connecticut 
State Department of Education (CSDE) consultants, representatives of the 
Language Arts Advisory Committee and other language arts specialists from 
throughout the state, two chief readers and a project director from 
Measurement Inc. of Durham, North Carolina and a reading specialist from The 
Psychological Corporation met and read a substantial number of essays drawn 
from the total pool of essays to be scored. Approximately 60 essays were 
selected to serve as "range-finders" or "marker papers" representing the range 
of achievement demonstrated in the total set of papers. Copies of those 
range-finders served as training papers during the scoring workshops which 
followed. Each range-finder paper was assigned a score according to a 
four-point scale, where 1 represented a poor paper and 4 represented a 
superior paper. 

Scoring Workshops 

During the month of November, several holistic scoring workshops were held in 
various locations throughout the state. Attendance at the grade four scoring 
workshops totaled 271 teachers. A chief reader and two assistants were 
present at every workshop in addition to representatives of the CSDE. Each 
workshop consisted of a training session and a scoring session. 

Training and Qualifying 

0 All teachers were shown approximately fourteen range-finder papers. 
The chief reader discussed each paper and explained the reason why 
each received its score. 
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0 An teachers were given a six-paper practice set* They scored the 
papers independently and recorded the scores on their papers. When 
an teachers were finished, the chief reader discussed each paper and 
explained why each received its score, 

0 An teachers were given a nine-paper training set. They scored the 
papers independently, based on an overan impression, and recorded 
their scores on a monitor sheet as wen as on their papers. As they 
finished reading and scoring, they brought the monitor sheet to the 
team leader who checked the scores. When an teachers were finished 
and an monitor sheets were checked, the chief reader discussed the 
nine-paper set. 

0 Regardless of whether or not they quanfied on the first training 
set, an teachers were then given another nine-paper training set. 
They scored the papers and had the monitor sheets checked. Set Two 
was not discussed, except with non-quanfiers. 

0 Teachers were considered quanfied if they scored six or more papers 
correctly on either set. Teachers who met the standard began scoring 
actual test papers after Set Two. 

0 If any teacher did not quanfy, they received additional training by 
one of the team leaders or by the chief reader away from the scoring 
room. They had two more opportunities to quanfy. Any teacher who 
failed to quanfy would have been excused from the project and paid 
for one day. 

The Scoring Session 

Once scorers quanfied, actual scoring of the writing exercises began 
according to the steps outlined below: 

0 Scorers read each paper once carefully but quickly and designated a 
score. Again, the score reflected the scorer's overall impression of 
the response as it corresponded with the features of written 
composition which were internalized during the training process. 

0 Each paper was read and scored by a second scorer independently of 
the first, that is, without seeing the score assigned by the first 
reader. The chief reader had the responsibility of adjudicating any 
disagreement of more than one point between the judgments of the 
first two scorers. In other words, adjacent scores (i.e., awarded 
scores of 4 and 3, 1 and 2, 2 and 3) were acceptable, but larger 
discrepancies (i.e., scores of 2 and 1, 3 and 1, 1 and 4) were 
resolved by the chief reader. In general, with successful training, 
the occurrence of large score discrepancies is rare. 

0 The two scores for each paper were a dded to produce the final score 
for each student, resulting in scores between 2 and 8. 
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Understanding the Holistic Scores 



Examples of actual student papers which are representative of the scoring 
range will assist the reader in understanding the statewide standard set for 
writing and Interpreting the test results* Sample papers representing four 
different holistic scores are presented on the following pages. Note that the 
process of summing the scores assigned by the two readers expands the scoring 
scale to account for "borderline" papers. A paper which receives a 4 from 
both scorers (for a total score of 8) is likely to be better than a paper to 
which one reader assigns a 4 and another reader assigns a 3 (for a total score 
of 7). In addition, it should be emphasized that each of the score points 
represents a range of student papers—some 4 papers are better than others. 

A score of Not Scorable (NS) was assigned to student papers in certain cases. 
A score of NS indicates that the student's writing skills remain to be 
assessed. The cases in which a score of NS was assigned were as follows: 

0 responses merely repeated the assignment; 

0 illegible responses; 

0 responses in languages other than English; 

0 responses that failed to address the assigned topic in any way; and 

0 responses that were too brief to score accurately, but which 

demonstrated no signs of serious writing problems (for example, a 
response by a student who wrote the essay first on scratch paper and 
who failed to get very much of it copied). 

Both readers had to agree that a paper deserved a NS before this score was 
assigned. If the two readers disagreed, the chief reader arbitrated the 
discrepancy. Papers which were assigned a score of NS were not included in 
summary reports of test results. 

Summary Comments 

The fact that standards must be maintained and reinforced throughout a scoring 
session cannot be overemphasized. Holistic scoring depends for its usefulness 
on consistency of scoring among all scorers throughout the sessions. 
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CONNECTICUT MASTERY TEST 
1991 Grade Four 
Writing Assignment 



One day you meet a creature from outer space. You are the only one who can 
see it. 

Write a story telling your classmates about your adventure with the creature 
from outer space. 

• Tell what the creature looked like and how it acted. 

• Write a story about what happened when you met it. 
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APPENDIX F 
Grade Four Analytic Rating Guide 
and 

Marker Papers for Analytic Scoring 
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Grade Four Analytic Rating Guide 
FOCUS: How effectively does the writer unify the paper by a dominant topic? 

1 « switches and/or drifts frequently from the dominant topic 

2 « switches and/or drifts somewhat from the dominant topic 

3 « stays on topic throughout the response 

ORGANIZATION: Is there a plan that clearly governs the sequence from the 
beginning to the end of the response, and is the plan effectively signaled? 

1 - no discernible plan 

2 « inferable plan and/or discernible sequence; some signals may be 

present 

3 « controlled, logical sequence with a clear plan 

SUPPORT/ ELABORATION: To what extent is the narrative developed by details 
that describe and explain the narrative elements (character, action and 
setting)? 

1 « vague or sketchy details that add little to the clarity of the 

response or specific details but too few to be called list-like 

2 - details that are clear and specific but are list-like, or uneven, or 

not developed 

3 - somewhat developed details that enhance the clarity of the response 

CONVENTIONS: To what extent does the student use the conventions of 
standard written English (e.g.. sentence formation, spelling, usage, 
capitalization, punctuation)? 

1 « many errors 

2 « some errors 

3 - few errors 
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APPENDIX G 

Sample Grade Four Mastery Test Score Reports 

0 Class Diagnostic Report 

- Mathematics 

0 School by Class Report 

- Mathematics 

0 Class Diagnostic Report 

- Language Arts 

0 School by Class Report 

- Language Arts 

0 District by School Report 

- Language Arts 

0 Parent/Student Diagnostic Report 
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APPENDIX H 
Fall 1991 Grade Four 
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APPENDIX J 
Type of Community Classifications 



J. 



FRir 



Type of Community 



TOC 1 - LARGE CITY - a town with a population of more than 100,000. 

TOC 2 « FRINGE CITY - a town contiguous with a large city and with a 
population over 10,000. 

TOC 3 - MEDIUM CITY - a town with a population between 25,000 and 100,000 and 
not a Fringe City. 

TOC 4 « SMALL TOWN (Suburban) - a town within an SMSA* with a population of 
less than 25,000, not a Fringe City. 

TOC 5 - SMALL TOWN (Emerging Suburban) - a town with a population of less than 
25,000 included in what was a proposed 1980 SMSA but not included in a 
1970 SMSA. 

TOC 6 - SMALL TOWN (Rural) - a town not included in an SMSA, with a population 
of less than 25,000. 



*Standard Metropolitan Statistical Area 
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Education Reference Group Descriptions 



The education reference groups were formed from an analysis of districts* 
median family income, a percentage of high school graduates, a percentage of 
those in managerial/professional occupations, a percentage of single-parent 
families, a percentage of those below poverty and a percentage of non-English 
home language from the 1980 census. The groups have not been named, but have 
been labeled I through VII. Note, however, that the groups run from extremely 
affluent suburban communities (I) to our three largest cities of Hartford, 
Bridgeport and New Haven (VII). Some differ widely with respect to all of the 
family background variables; others differ slightly with respect to one or 
two. In addition to the six variables used to classify districts, the group 
descriptions below also include superintendents' comments that were provided 
in a Department survey in 1988. 

Group I. These 13 districts were wealthy, professional suburbs. The median 
family income in 1979 averaged $40,425. Residents were extremely well 
educated. Nearly 901 had at least a high school diploma, 42% had a bachelor's 
degree and 49% had a managerial or professional job. There were relatively 
few children with educational disadvantages here. Only 7% of the families 
were single-parent, about 8t spoke a language other than English at home and 
almost no one iZt) lived in poverty. Superintendents within these towns used 
the adjectives "suburban," "affluent," "growing" and "bedroom community" to 
describe them. 

Group II. Residents in the 29 districts of Group II were affluent, 
well-educated professionals, but to a lesser extent than residents of 
Group I. The median family income averaged $28,113, more than 83% of the 
residents had high school diplomas, 29% had a college degree and 36% had a 
managerial or professional job. Like Group I, this group had a low percentage 
of people who spoke another language at home (8%), almost no one in poverty 
(2%) and relatively few single-parent families (9%). Like the superintendents 
in Group I, superintendents from these towns described their communities as 
"affluent," "bedroom communities," "growing" and "suburban." 

Group III. These 34 districts were mostly rural bedroom communities. Like 
Groups I and II, these towns did not have many disadvantaged children. There 
were only 7% who spoke a language other than English at home, only 7% who were 
from single-parent families and only 3% who were poor. Adults were slightly 
less affluent (median family income of $24,431), less likely to have a high 
school diploma (77%) and less likely to have a managerial or professional job 
(28%) than people in Group II. Like the previous two groups, these towns were 
described by superintendents as "suburban," "growing" and "bedroom 
communities." Several superintendents used "rural" and "middle class" (as 
well as "affluent") to describe their communities. 
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Group IV. This group of 37 districts was probably the most diverse set of 
towns, containing a number of coastal and resort communities, as well as rural 
and suburban areas. Group IV was similiar to Group III in median family 
income ($22,609), percentage of high school graduates (77X), percentage of 
managers/professionals (29%) and percentage of non-English home language (7%). 
but had a significantly higher percentage of single-parent families 
(12% versus 7%) and a slightly higher percentage of families below poverty 
(5% versus 3%), Superintendents' descriptions reflect this group's 
diversity- They describe their towns as "bedroom," "growing," "rural," 
"suburban," "middle income" and "affluent," 

Group V. These 30 districts made up the first group of working class/blue 
collar communities. This group had a significantly lower percentage of high 
school graduates (68%) and percentage of managers/professionals (19%) than 
Group IV, Other characteristics were similar to Group IV: the average income 
was $21,920, there were 11% single^parent families, 5% below poverty and 9% of 
the population spoke a language other than English at home. 

Group VI. This group of 23 districts included the state's medium-sized 
cities, the larger cities of Stamford and Waterbury, several former mill towns 
and some densely populated blue collar suburbs. Group VI had similar 
socioeconomic characteristics as Group V, but significantly greater 
proportions of single-parent families and families in which English was not 
the primary home language. The median family income of $20,325 was below the 
state average. An average of 16% of the residents spoke another language at 
home and 17% of the families were headed by single parents. Only 63% of the 
residents had high school diplomas, and 6% lived below poverty level. 

Group VII. Hartford, Bridgeport and New Haven were vastly different from 
other communities in Connecticut, An average of 28% of the families spoke a 
language other than English, 46% were headed by single parents, 20% lived in 
poverty and the median family income was $15,240, 
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